In search of the unknown user: indexing, hypertext and the world wide web

نویسندگان

David Ellis

Nigel Ford

Jonathan Furner

چکیده

For the purposes of this article, the indexing of information is interpreted as the pre-processing of information in order to enable its retrieval. This definition thus spans a dimension extending from classification-based approaches (pre-co-ordinate) to keyword searching (post-co-ordinate). In the first section we clarify our use of terminology, by briefly describing a framework for modelling IR systems in terms of sets of objects, relationships and functions. In the following three sections, we discuss the application of indexing functions to document collections of three specific types: (1) ‘conventional’ text databases; (2) hypertext databases; and (3) the World Wide Web, globally distributed across the Internet.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Technique for Improving Web Mining using Enhanced Genetic Algorithm

World Wide Web is growing at a very fast pace and makes a lot of information available to the public. Search engines used conventional methods to retrieve information on the Web; however, the search results of these engines are still able to be refined and their accuracy is not high enough. One of the methods for web mining is evolutionary algorithms which search according to the user interests...

متن کامل

The Hidden Web

World Wide Web by browsing hypertext documents has led to the development and deployment of various search engines and indexing techniques. However, many information-gathering tasks are better handled by finding a referral to a human expert rather than by simply interacting with online information sources. A personal referral allows a user to judge the quality of the information he or she is re...

متن کامل

Structural Abstractions of Hypertext Documents for Web-Based Retrieval

There have been connicting views in the literature on the capability of tools and mechanisms for storing and accessing information in Internet. On one hand it has been criticized for a long time that World Wide Web ooers a chaotic environment for Web agents to extract information because the description of a document by HTML is friendly for humans to understand, but is not so to machines. On ot...

متن کامل

Hierarchical Fuzzy Clustering Semantics (HFCS) in Web Document for Discovering Latent Semantics

This paper discusses about the future of the World Wide Web development, called Semantic Web. Undoubtedly, Web service is one of the most important services on the Internet, which has had the greatest impact on the generalization of the Internet in human societies. Internet penetration has been an effective factor in growth of the volume of information on the Web. The massive growth of informat...

متن کامل

Anchor point indexing in Web document retrieval

Traditional World Wide Web search engines, such as AltaVista.com, index and recommend individual Web pages to assist users in locating relevant documents. As the Web grows, however, the number of matching pages increases at a tremendous rate. Users are often overwhelmed by the large answer set recommended by the search engines. Also, if a matching document is a hypertext, the document structure...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

Journal of Documentation

دوره 54 شماره

صفحات -

تاریخ انتشار 1998

In search of the unknown user: indexing, hypertext and the world wide web

نویسندگان

چکیده

منابع مشابه

A Technique for Improving Web Mining using Enhanced Genetic Algorithm

The Hidden Web

Structural Abstractions of Hypertext Documents for Web-Based Retrieval

Hierarchical Fuzzy Clustering Semantics (HFCS) in Web Document for Discovering Latent Semantics

Anchor point indexing in Web document retrieval

عنوان ژورنال:

اشتراک گذاری